Using the Benford’s Law as a First Step to Assess the Quality of the Cancer Registry Data
نویسندگان
چکیده
BACKGROUND Benford's law states that the distribution of the first digit different from 0 [first significant digit (FSD)] in many collections of numbers is not uniform. The aim of this study is to evaluate whether population-based cancer incidence rates follow Benford's law, and if this can be used in their data quality check process. METHODS We sampled 43 population-based cancer registry populations (CRPs) from the Cancer Incidence in 5 Continents-volume X (CI5-X). The distribution of cancer incidence rate FSD was evaluated overall, by sex, and by CRP. Several statistics, including Pearson's coefficient of correlation and distance measures, were applied to check the adherence to the Benford's law. RESULTS In the whole dataset (146,590 incidence rates) and for each sex (70,722 male and 75,868 female incidence rates), the FSD distributions were Benford-like. The coefficient of correlation between observed and expected FSD distributions was extremely high (0.999), and the distance measures low. Considering single CRP (from 933 to 7,222 incidence rates), the results were in agreement with the Benford's law, and only a few CRPs showed possible discrepancies from it. CONCLUSION This study demonstrated for the first time that cancer incidence rates follow Benford's law. This characteristic can be used as a new, simple, and objective tool in data quality evaluation. The analyzed data had been already checked for publication in CI5-X. Therefore, their quality was expected to be good. In fact, only for a few CRPs several statistics were consistent with possible violations.
منابع مشابه
Application of Benford’s Law in Analyzing Geotechnical Data
Benford’s law predicts the frequency of the first digit of numbers met in a wide range of naturally occurring phenomena. In data sets, following Benford’s law, numbers are started with a small leading digit more often than those with a large leading digit. This law can be used as a tool for detecting fraud and abnormally in the number sets and any fabricated number sets. This can be used as an ...
متن کاملHealth systems research initiative to tackle growing road traffic injuries in India
Road traffic injuries (RTIs) are the sixth leading cause of deaths in India and about 400 deaths take place every day due to road traffic accidents. The present paper analyses the data of the India’s National Crime Record Bureau (NCRB) to assess the burden of RTI. In addition, it reports the health systems research initiated by the Indian Council of Medical Research (ICMR). As per NCRB data, in...
متن کاملPrevalence of Bladder Cancer in the Kerman Province, Southeastern Iran, using the Complete Prevalence Method
Background: Bladder cancer is the 10th most common cancer worldwide. We aimed to assess the prevalence of bladder cancer in the Kerman Province, in southeast Iran. Materials and Methods: In this cross-sectional study, we used data on 1272 patients with bladder cancer registered in the Kerman population-based cancer registry from 2014 to 2017. There were two parts of data including observed dat...
متن کاملپیشنهاد یک نظام ملی ثبت تروما برای ایران
Trauma is the fourth cause of death at all age groups with socio-economic costs which caused more than 6 million deaths in the world in 2000. Despite promising trend in improving many aspects of health case and treatment in the last decade in our country, little attention has been paid to the subject of registering trauma on an international standard. Effective practical research specially ...
متن کاملبررسی اعتماد به اطلاعات مردمی به عنوان منبعی جهت ثبت سرطان
Background and Aim: Cancer registration based on hospital information, clinically and partaclinically derived data from health centers and labs, may have some shortcomings in recording all cancer cases, especially in the developing countries. Thus, in this study we tried to assess the possibility of using public data concerning cancer incidence among their relatives as a complementary source o...
متن کامل